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ABSTRACT 

A new approach to the study of the large-scale stellar cluster distribution in the Galaxy 
based on two-point correlation techniques is presented. The basic formalism for this 
method is outlined and its applications are then investigated by the use of a simple 
model of cluster distribution in the Galaxy. This provides an estimate of the potentials 
of the two-point correlation function for indicating clustering in the measured star 
positions, which can be related to the presence of star clusters in the observed field. 
This technique is then applied to several areas of the Two Micron Galactic Survey 
catalogue, from which information is obtained on the distribution of clusters according 
to position in the Galaxy, as well as about age, density of stars, etc. 

Key words: stars: statistics - infrared: stars - open clusters and associations: general 
- Galaxy: structure 



1 INTRODUCTION 

Open star clusters provide valuable information on the evo- 
lution of the Galaxy. In this paper two-point correlation 
techniques are used to analyse the distribution of open clus- 
ters in order to gain an insight into the structure and evo- 
lution of the Galaxy. 

Open-cluster distributions have been widely studied at 
optical wavelengths as a means of studying Galactic struc- 
ture and evolution (see, for example, Lynga 1987b; Janes & 
Phelps 1994) . The Lynga catalogue of open clusters (Lynga 
1987a) lists about 1200 clusters, which represent nearly all 
the open clusters accessible in the visible. Knowledge of the 
positions and ages of these clusters (a method of age de- 
termination for clusters is given by Carraro & Chiosi 1994) 
enables the scale length and scale height of the disc to be 
derived for both young and old clusters (for a review of old 
open clusters, see Friel 1995) and theories to be developed 
on their formation and destruction throughout the history 
of the Galaxy. 

The limitations on these studies are imposed by the 
maximum distance at which open clusters can be detected. 
Most of the cataloged open clusters are in the solar neigh- 
bourhood, and very few have distances greater than 3 kpc 
(Payne-Gaposchkin 1979). Hence, information is obtained 
only for a small region of the Milky Way. The problem is 
caused by interstellar extinction. An excellent tool for study- 
ing star clusters and star formation regions is to observe 
in the infrared (Wynn- Williams 1977), where the effect of 
the extinction is far less. To date, however, the infrared has 



been little used in this field owing the absence of suitable 
databases. 

The K band is probably the best region of the spectrum 
for tracing the stellar distribution of the Galaxy. The K 
radiation is a mass tracer in spiral galaxies because it follows 
the old stellar population (Rhoads 1995). Furthermore, the 
K light is dominated by high-mass stars in star formation 
regions, i.e. in open clusters, so it is especially appropriate 
in the search for young clusters, which are rich in massive 
stars. 

As explained in more detail by Garzon et al. (1993), the 
Two micron Galactic survey (TMGS) is a 7^-band survey of 
various regions along the Galactic equator between —5° < 
I < 35°, |6| < 15° and 35° < I < 180°, |6| < 5°. The TMGS 
catalogue has a positional accuracy of about 4 arcsec in right 
ascension and 7 arcsec in declination. These errors have been 
estimated after cross-correlating the original TMGS source 
positions with Guide Star Catalogue (GSC) counterparts. 
The larger error in declination comes from the orientation 
of the array with respect to the survey direction. Due to the 
dead spaces between detectors, the K limiting magnitude 
for completeness has to be set conservatively within a range 
from 9 to 9.5 mag, although the detection limit magnitude 
of the survey is well in excess of 10 mag. 

In this paper a new method is presented for automat- 
ically determining the level of clustering in catalogues, the 
TMGS being used as an example. A set of criteria are de- 
fined for an automatic search for correlations among stars 
by means of the two-point correlation function and the two- 
point angular correlation function. A simple model which 
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assumes a regular distribution of clusters with a constant 
star density is developed in Section 3. Predictions from this 
model are then compared with the TMGS and the discrep- 
ancies analysed. The causes of clustering are then discussed. 
The use of the tools described in Section 2 and their appli- 
cation to the TMGS catalogue are dealt with in Section 5, 
and the results for several regions of our Galaxy are given 
in Section 6. Finally, a summary of the results is given, and 
suggestions are made for future developments of the method- 
ology described here. 



2 THE TWO-POINT CORRELATION 
FUNCTION AND THE TWO-POINT 
ANGULAR CORRELATION FUNCTION 

Occasionally, when the scale length of a system is changed, 
certain aspects of the system remain invariable, as in the 
case for the distribution of matter in space. For example, 
there are mathematical methods for handling the spatial dis- 
tribution of atoms in solids, gases and (particularly) liquids. 
Cosmologists face the same kind of mathematical problem 
when working with the distribution of galaxies and clusters 
of galaxies in the context of the large-scale structure of the 
Universe. They treat the Universe as a fluid whose 'particles' 
are galaxies. Our aim in this paper is to develop the use of 
similar mathematical methods on an intermediate scale, i.e. 
in examining the distribution of the stars that make up our 
Galaxy. 

Correlation functions describe how points are dis- 
tributed (e.g. Peebles 1980; Borgani 1995). Suppose a local 
density of objects SN/SV = p(f) and an average density (p) 
(hereafter, the symbol (■ ■ ■) indicates a local volume aver- 
age). Note that p corresponds to the exact distribution of 
objects, i.e. it is a Dirac delta function with zero value where 
there are no objects, and (p) is the local average density, i.e. 
the number of objects per unit volume, and provides no in- 
formation concerning their distribution. 

The two-point correlation function (TPCF) is defined 

as 



a\r~r\) 



(p{r))* 



(1) 



The function £(r) expresses the excess over the ran- 
dom probability of finding objects at separation r. (£(r) = 
means that the probability is totally random; £(r) > that 
the probability is greater than random, i.e. that there is 
clustering; and £(r) < that the probability is less than 
random, i.e. that there is relative avoidance). 

In the same way, the two-point correlation function can 
be defined for two dimensions on the surface on to which 
the distribution is projected (the celestial sphere in the case 
considered here). This is called the two-point angular corre- 
lation function (TPACF) and is defined as 



uj(\e-6'\) = 
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where a is the surface density per unit solid angle. 

Another mathematical technique for deciding whether 
a distribution is non-Poissonian is area tessellation, as was 
used by Balazs (1995) to test the grouping tendency of H a - 
emission stars in the Orion molecular clouds without giving 



a quantitative measure of the departure from the Poisso- 
nian distribution. See also Pasztor et al. (1993), Pasztor & 
Toth (1995) and references therein for other astronomical 
applications of spatial statistics. 



2.1 Relationship between the TPCF and the 
TPACF for stars 

When applying the above definitions to stars in the Galaxy, 
the luminosity function and space density have to be taken 
into account. 

By generalizing the result of the Limber (1953) equa- 
tion for constant density, the relationship between the TPCF 
(which is non-zero for distances less than Ar) and the 
TPACF (for small 8) for any density distribution is 
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where r is the distance along the line of sight, M the absolute 
magnitude, <f>(M) the luminosity function and 
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The minimum and maximum values of M for a distance r 
depend on the minimum and maximum values of the appar- 
ent magnitude and the extinction along the line of sight. 

In this case, it is assumed that the absorption is not 
patchy, i.e. that it is independent of 6 for small angles. This 
is not exactly true but it will be show in Section 2.4 that 
the effects are negligible. 

The subscript 't' stands for 'total', a projection of 
all distances and magnitudes, and (a t ) is the total two- 
dimensional density for all distances and magnitudes. In the 
literature, {at) is also called A(m m in, m max , I, b) and repre- 
sents the star counts in the magnitude range (Bahcall 1986). 

This expression enables the TPACF to be found once 
the three-dimensional distribution of the stars is known and 
forms the basis of this article, in which we create a model 
distribution of the stars and compare the results obtained 
with those observationally in order to investigate the distri- 
bution of clustering in the structure of our Galaxy. 

In general, the TPACF cannot be inverted to give the 
TPCF due to the multiplicity of possible solutions and to the 
lack of precise knowledge of certain parameters. However, 
there are certain cases in which the equation can be inverted 
and TPCF obtained from TPACF (Fall & Tremaine 1977). 
A trivial example where inversion is possible is that of a 
Poissonian three-dimensional distribution, which implies a 
Poissonian projected distribution and vice versa, i.e. £ = 0, 
uj — on all scales. Another example is when (p)(r) is a 
constant independent of r. 
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2.2 Definition of new variables 

In order to simplify the comparison of the level of clustering 
for different regions of the sky, two new variables will be 
introduced. 

#max is defined as the first zero of In this article 

(see for example Fig. [?]), Lu t is positive up to a separation 
#max- For values greater than # ma x this value is small and 
oscillates about zero, as there is no correlation among stars 
separated by large angular distances. 

Another definition, corresponding to the integration of 
uJt up to the limit 6 = # max {0 > # ma x would give a null 
contribution to the integral), is 
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which means the excess (when C2 is positive) or deficit 
(when C2 is negative) of the relative number of objects with 
respect to a Poissonian distribution in a circle centred on an 
arbitrary star on the celestial sphere, within the observed 
solid angle and with angular radius 9 mlx . The relative cor- 
relation within the angular scale # max is therefore measured. 
We call C2 the 'accumulation parameter' (N.B. there are also 
other definitions in the literature of the TPACF integral, e.g. 
Wiedemann & Atmanspacher 1990). 

The variable C 2 has a clear meaning associated with 
projected clustering and is also a useful number to measure. 
Since it sums several values of ui for different angles, it con- 
denses the information of interest into a single number that 
can be compared for different samples of stars and give the 
degree of clustering. This parameter is a mathematical ex- 
pression of the degree of clustering seen in fields of stars. 
The idea that we wish to stress here is that all mathemat- 
ical developments described in this paper are designed to 
put the intuitive idea of clustering to a reliable test. These 
calculations are necessary for a quantitative, as opposed to 
a merely qualitative, description of clustering. 

Applying the expression (^) of uj t in C2, we get 
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where 2, an integrated TPCF, is 
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2.3 Further approximations 

In order to simplify the above calculations, it will be as- 
sumed that the distribution of stars does not depend on 
their luminosity, i.e. that £(y;r, M, M') = £{y;r). This is 
not completely true as there is a small dependence on the 
distribution of stars in a cluster according their masses, and 
the luminosities are dependent on the masses. A complete 
calculation taking the luminosity function into considera- 
tion would be of great value. However, the relationship be- 
tween the TPCF and the luminosity function is uncertain, 



although the effects of this approximation for the detection 
of clusters are expected to be small. 

With this approximation, and from (Q) and 
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where (N*(r)) is the number of stars observed per unit solid 
angle at a distance r: 

(N*)(r) =r 2 (p)(r) dM <f>(M). (10) 

J WminM 



The variable uit can also be expressed as 



wt(0) = E(r») 

where the averages r and S are such that match (| 
Also, from (|fj), 
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This last equivalence is a way of averaging the function £. 

Hence, high values of C 2 indicate that there must be 
high projected clustering in the direction of the beam. 

2.4 Patchiness of extinction 

It is clear that extinction can distort the observed counts, 
the amount of the distortion being a matter of controversy. 
It is generally accepted that in the optical wavelengths this 
influence is very severe, particularly in regions near to or 
in the Galactic plane in the inner Galaxy, where the strong 
and patchily distributed obscuration makes it difficult to 
penetrate deep into the Galaxy. The amount of extinction 
decreases substantially with increasing wavelength. Maihara 
et al. (1978) quoted a value of 0.17 mag kpc -1 as typical for 
extinction in the Galactic plane in the K band, compared 
with 1.9 mag kpc" 1 for the V band (Allen 1973). 

This has two important consequences. First, the K band 
is more effective at penetrating the interstellar dust. Sec- 
ondly, the observed stellar distribution more closely resem- 
bles the true distribution. For the second argument to be 
true it is necessary that the obscuration in the K band 
should not only be smaller in amount than in the V band, 
but also that its patchiness should be less important. 

This rather uniform distribution of the interstellar ex- 
tinction in K can be inferred from the TMGS histograms in 
several cuts across the Galactic plane. Garzon et al. (1993, 
their Fig. 8) compared the observed stellar distribution in 
the TMGS and the GSC in the V band. It is noticeable how 
uniform the K histograms are, particularly when compared 
with those for the GSC. Except for small portions highly 
concentrated in the Galactic plane and more marked in the 
central regions, the shape of the high spatial resolution dis- 
tribution curves of the TMGS does not exhibit the 'noisy' 
pattern of the GSC plots, which is certainly due to the pres- 
ence of strong and patchily distributed extinction. 

Hammersley et al. (1994) showed similar histograms for 
different areas which also have similar shapes. Moreover, a 



4 M. Lopez- Corredoira et al. 



good fit to a classical exponential disc can be seen in Fig. 
3 of that paper; this would not be the case if the extinction 
were important and non-uniform. 

This conclusion can also be reached from the contour 
maps of the bulge of the Galaxy of Dwek et al (1995), who 
showed the residuals of the DIRBE data after disc subtrac- 
tion and extinction correction. Again, the general shape of 
the maps proves the basic uniformity of extinction distribu- 
tion in the near infrared. 

We now estimate these effects. From (Q) with the change 
of variable r = lQ( 6 +»»m«- M >»«)/ 5 an d 

/A/max /-M max 
dM<t>{M)& j dM 4>(M), (13) 

the local cumulative counts <r t follow the expression 
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ignoring the variation of extinction with the distance. If we 
take the density D as constant, then 

3m max /5 



N(m n 



oc 10° 
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Taking D as constant is sufficient for estimating the the or- 
der of magnitude of the patchiness due to extinction. In any 
case, the above proportionality is followed in the observed 
cumulative counts but with a constant value of between 1 
and 2 instead of 3/5 in the exponent. 

An excess of extinction, Aa(9), due for example to a 
cloud at an angular distance \9 — 9q\ with respect to a given 
point 9o, will cause a reduction in the apparent flux of a 
fraction, /, of stars (behind the cloud), thereby creating the 
same effect as a reduction in the maximum apparent magni- 
tudes of these stars by Aa(9) mag, or, if Aa(9) is relatively 
small, a reduction in m max by fAa{ff) mag for all stars. 
Hence, 



<r t (0) ~ a t (6lo)10- 3(a(( ' ) - a(flo))//5 . 



(16) 



If it is assumed that the observed flux fluctuations, 
AF, are due mainly to extinction variations, with the small- 
fluctuation approximation, then both are related by 



Aa = -2.5 log 



1 AF 
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(the factor / appears again here for the same reasons as 
above). So, from equation (|l6|), using the small- fluctuation 
approximation, 



at(0) 3 F{9) 
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This means that the angular correlation of star density is 
about 3/2 times the angular correlation of the flux. 

Averaging the DIRBE K flux (Boggess et al. 1992) fluc- 
tuations from the maps with 2520" resolution over | b |< 3° 
for constant-Z strips over the range —35° < I < 35° (where 
the effects of extinction are most relevant), we get root mean 
squares of 



0.03 < cr(AF/F)i =coa3t . M < 3 ° < 0.23, 
with an average of 



(19) 



The oscillations of flux fluctuations are not very high in the 
plane, their maximum being 2.3 times the average. 

From equation (|l^), and taking into account that the 
root mean square is yJui{Q), 



>(0)i= 



const. ,|6|<3° 



0.015 
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for regions of 2520" in size. In the most unfavourable case, 
where the extinction is highest (multiplied by a factor of 
2.3 2 because the maximum root mean square is 2.3 times 
greater than the average), w(0) ~ 0.08. 

Higher-resolution flux maps are not available in the K- 
band for the whole sky so we cannot derive these numbers for 
smaller scales, but they are not expected to be much higher 
since average cloud size is of the order of degrees (rather 
higher than 42') and the cloud distribution is fairly smooth. 
A fractal distribution would increase the contamination but 
this may apply only to very cold gas clouds (Pfenninger & 
Combes 1994) which are not the main cause of extinction in 
the K band. 

We conclude that extinction in K cannot be responsible 
for correlations w(0) greater than ~ 0.08. This is just an 
estimate, but the order of magnitude should not be very 
different. As will be shown, the results when applied to the 
TMGS are above this value (see, for example, Fig. Q), and 
causes other than patchy extinction must explain this. 



3 A SIMPLE CLUSTERING MODEL 

In order to gain an understanding of how the accumulation 
parameter (C2) varies with Galactic position, a model of 
stellar distribution is required. The model to be adopted in 
this section is very simple and consists of a group of spher- 
ical star clusters separated by distances much larger than 
the sizes of the clusters and embedded in a Poissonian dis- 
tribution of field stars with average density p nc . The density 
of clusters is (n c \). To simplify the problem, a constant star 
density, p c \, is assumed for each cluster (p nc < p c \); it is also 
assumed that the clusters have the same radius, R c i (i.e. 
they are filled homogeneous spheres). In fact, the density of 
clusters, their radii and their internal stellar densities are 
different in different regions of the Galaxy, however there is 
insufficient information to construct a more detailed model, 
and our main interest here is in how Ci varies qualitatively. 

According to the definition, the TPCF is the average of 
the product of two numbers (for a distance y): the first is 
the probability of finding an object in a given position, and 
the second is the number of times that the object counts 
in a shell of radius between r and r + dr exceeds the same 
counts in a Poissonian distribution. The first number, the 
probability of finding an object in a volume dV centered 
on f within a total volume V, is p(r)dV/{p)V . With regard 
to the second number (Fig. |l|), there are two cases to be 
considered: 

(i) When r is at distance x < R c i + y from the cen- 
tre of a cluster, the second number is the sum of 
the excess (with respect the average, i.e. (p)) of ob- 
jects in the part of the shell that is inside the cluster 
{S ss (x;y,R c \)/A-Ky 2 x (p c] - (p))/(p)) minus the deficit 
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Figure 1. Intersection of two spheres. 



of objects in the part of the shell outside the cluster 
{(4-Ky 2 - S ss (x;y,R cl ))/^y 2 X «p) - p ac )/(p». Here 
S ss (x;y,R c i) is the area of a spherical surface of radius 
y inside another sphere of radius R c \ whose centre is at 
distance x from the first one. 
(ii) When the distance, x, from any cluster is larger than 
Rci + y, the second number is the negative quantity 
~({p) ~ pnc)/(p), i.e. the deficit of objects compared 
with a Poissonian distribution with density (p) . 

When y is sufficiently large there will be cases in which 
the shell intersects more than one cluster. However, such 
large values of y are of no interest here, and only those cases 
in which the magnitude of y is of the same order as that of 
R c \ will be considered. 

Thus, the expression for the TPCF is 



U '/! a, pi" / dx x 2 ^ 



S aa (x; y, R V){pc\ ~ (p» - (47n/ 2 - S ss (x; y, R cl ))((p) - p n c) 
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2 P nc 



S ss (x;y,RJ)(p c i - (p>) - (4ttj/ 2 - S ss (x;y, R cl ))((p) - p nc ) 



1 - {n cl )-TT(R cl +y) 3 



47ry 2 (p) 
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The average density is 

4 3 / 4 3 \ 
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Also, from the appendix AI: 



(26) 
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-ttR 3 v 2 + — f-K-Y 



j/ < 2R cl 
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and 



da; x 2 S ss (x;y,Rc 



URlxv 2 , 



3 „,2 / 3 _y 1 / y 

4 R cl 10 [ R c 



y < 2R cl 
y > 2Rd 



(28) 



We insert the last five equalities in ( J22| ) and, after sim- 
plifying, this leads to: 



<™cl> 



P c l - Pnc 
. (P> 



£(») = 



- 3 TH cl (n. cl ) - J7I — + TO (77^-J 

< "->( ££ ii ££ ) 2 ^ B ci(^^<"oi>) 



V < 2-R cl 
» > 2R cl 



} (29) 



The reader should bear in mind that the applicable 
value of y which is that for distances smaller than the mini- 
mum distance between two clusters. Due to the properties of 
the TPCF, the quantity f M ^ dV y t{y) = 4tv /" dy y 2 ^{y) 
should be equal to zero. This is not exactly the case in 
([29|), which is not valid for y as large as the typical dis- 
tances among clusters. Nevertheless, it can be seen that 
f v _(„ }-i dV y (;(y) = 0, i.e. the volume in which ( pj| ) can 
be used is roughly (n c i) -1 . 

The behaviour of the function can be seen in the Fig. 
^| for (rici)^! = 10 -3 , pel/ pnc = 100. It becomes constant 
for y > 2R C \, and the value in which £ = is y — 1.90J? c i, 
which also is very close to 2R C \. In what follows, the term 
TjirRcifoci) will be neglected because in practice it is too 
small (the separation among clusters is much larger than 
Rd). 

The next step is the calculation of the integrated TPCF 
(S) using ([?]) and including (|2e|). In this case, Ar is the first 
value that follows £(\/y 2 + (r — r') 2 ) = 0, which is Ar = 
1 2 . Hence, 



■(•- 



+ (r - r'Y) 



v \ 8 16 



and 



Fig. |2| shows that the behaviour of the TPCF and the 
integrated TCPF (£ and H) are not very different. 

Now, using (|^), the observed value C2 will be calculated. 
The theoretical lower limit of the integral should be zero, but 
in practice values under a j/ m i n = r# m i n cannot be observed, 
due to the resolution 9 m i n of our detector (r is the distance 
of the zone of clusters) . 
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Model with clusters of constant density 

<n d >R cl 3 =1 ; pyp^lOO 
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Figure 2. Two-point correlation function, £, and integrated two- 
point correlation function, S, for a simple model. 
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where F(to) is 



2R C 



(31) 



F(t ) = 24 / d* t 



/■ 

■7t„ 



v \8 16 / 



(F(0) = 1, f(l) = 0). 

The observational value of C2 would be an average of 
this quantity with weig ht (N*) 2 (r), according to (Q, and 
with the characteristic parameters of the cluster depending 
on r, i.e. (n c i)(r) and - Pnc)/(p}) 2 (r). 



3.1 The contribution of a single shell of 
clusters 

In the case where there are clusters only in the shell between 
r and r + Sr, with Sr/r <C 1, and where the contribution 
to C2 is given only for this range of r (when in the other 
ranges of r the contribution is nil, or when the other shell 
with clusters has a negligible (N*)(r)), then 



2i? c 



(33) 



because the TPCF (£) in (|29|) is zero for this value (as 
f vriffi (n c i) is much smaller than unity). Then, introducing 



(|33j) in ( |3l| ) and feeding the result into (|12|), we obtain 

{N*) 2 (r)5r ^ ( p cl - p ac \U 3 



RdF 



( 0m in 
V $max 



(34) 



The factor F reduces the value of C2 when # max is only 
few times greater than 9 m in- Since # max = 2R c \/r, the more 
distant the clusters the smaller the value obtained for Ci. 
#max is normally enough large compared with 6 m i a for F to 
be considered as always close to unity. The values that F 
takes are shown in Table |l| 



Table 1. Correction factor F due to the finite resolution of the 
detector. 



X 


F(x) 





1.000000 


0.1 


0.956356 


0.2 


0.838143 


0.3 


0.671930 


0.4 


0.487970 


0.5 


0.314330 


0.6 


0.172488 


0.7 


0.074429 


0.8 


0.021008 


0.9 


0.002144 


> 1 






As can be seen, when x is greater than ~ 1/4, i.e. when 
the distance of the cluster is greater than ~ i? c i/2# m i n , the 
effect of the factor F begins to be noticeable. 

3.2 Clusters distributed throughout the 
Galaxy 

Suppose that there are clusters distributed throughout the 
Galaxy, i.e. that there are clusters at all distances along the 
line of sight in any direction. The TPACF (u>t), which is re- 
lated to the TPCF (£) through (|), is computed numerically. 
The function £(y) is obtained with (^), where it is assumed 
that the size and star density in the cluster is the same in 
all clusters, and that also the density of clusters, (n c i), is 
proportional to the mean density of stars: 



(n cl )=C(p). 



(35) 



The density of stars (p nc ) is inferred from the relation- 
ship (pi|). (N*)(r) is calculated using ([[(^), where (p) and 
4>(M) are calculated for a model Galaxy with two compo- 
nents: a disc and a bulge. The disc is taken from the Wain- 
scoat et al. (1992) model and the bulge model described in 
Lopez-Corredoira et al. (1997). The extinction law given in 
Wainscoat et al. (1992) is also used. 

The explicit dependence of C2 on I and b is calculated 
from the ui values and a # max that is derived with the ap- 
proximation 6> max « 2R c \/r, where 



J* °° dr rN* (r) 
f °°drN*(r) ' 



(36) 



#max is calculated in this way because the first zero of the 
function u> cannot be derived. In the previous approxima- 
tions, u was always positive and negative values were ne- 
glected . 
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Clusters distributed throughout the Galaxy 




150.0 200.0 



Figure 3. C 2 at b = 0°, b = 3° and b = 10° according to a 
distribution of simple clusters over all the Galaxy. 



Clusters distributed throughout the Galaxy 

i constant 




Figure 4. C 2 for different Galactic latitudes for / = 0° and 
I = 150° according to a distribution of simple clusters over all the 
Galaxy. 



When these operations are carried out for the values 
Rci = 1 pCjPci = 500 pc~ 3 and C = 10 -5 , the results shown 
in Figures H and Jm are obtained. 

In Figures g and ^] it can be seen how the correla- 
tion increases with distance from the Galactic plane (in- 
creasing |6|) as well as with distance from the centre of 
the Galaxy (increasing |/|). An explanation of this can be 
seen directly in the expression ( p^ ) , which is proportional to 
(n c i) {{pd — pnc)/{p}) 2 , where p c \ is a much larger constant 
than p nc and (n c i) is proportional to (p). It can be seen that 
£ is greater for smaller values of (p), i.e. it is greater away 
from the Galactic plane and far away from the bulge. Since 
the angular correlation function is an average of all the cor- 
relation functions in the line of sight, it produces the above 
results. 

This is merely a hypothetical example, and no mean- 
ing should be given to the actual values obtained since the 
parameters are invented. However, the qualitative depen- 
dence of u) on I and b is significant. This result is even 



more general than the particular case of the proportionality 
{n c i) = C{p). An increase in uj for increasing |/| or |fe| is given 
when (n c i) {{p c \ — Pnc)/(p}) 2 increases in a typical region in 
the line of sight. Even a dependence such as (n c i) = C{p) a 
with a < 2 is acceptable to allow this kind of dependence. 
Another significant prediction is that in the anticentre region 
the dependence with I is very smooth and nearly constant. 



4 THE CAUSES OF THE CLUSTERING 

The existence of clustering indicates that the formation of 
the stellar components of a cluster share the same time and 
place of birth, and that all the stars that are observed in 
the cluster once belonged to same originating cloud, which 
explains why they occupy a neighbouring position in space. 
The alternative hypothesis of an initial Poissonian distribu- 
tion of stars which collapsed to form the cluster is not fea- 
sible because cluster's gravity would have been too weak to 
restrain the velocities of the stars, which would have "evap- 
orated" from the primitive cluster. The fact is, then, that 
clusters originate from earlier clusters that were formed from 
a cloud, or several clouds in the same region. 

It is assumed in the TPCAF, equation (||), that only 
£, and not the extinction through the relationship between 
apparent magnitudes and ab solu te magnitudes, is dependent 
on 8. As discussed in section 



2.4, the contribution from cloud 



irregularity is irrelevant. 



4.1 Relationship with the evolution time 

The relationship of the accumulative parameter with the 
evolution time comes from the rate of evaporation of stars. 

When the effects of dust clouds are ignored, all contri- 
butions to C 2 come from £, i.e. clusters that happen to be 
in the line of sight. The relationship with the evolution time 
is through p c \ since stars escape from the cluster over time. 
According to Chandrasekhar (1942), the rate of escape of 
stars is 



Ap c i(t) 

Pel 



't e ' 



(37) 



where Q and Te are constants that depend on the character- 
istics of the cluster. Q is the rate of stars which can escape 
and Te is the average time that takes these stars to leave the 
cluster. Q is the fraction of stars with velocities greater than 
escape velocity. Chandrasekhar (1942) calculates a value of 
Q — 0.0074 for a relaxed cluster. Te (in Gyr) is the relax- 
ation time that is, for an average cluster with N stars, radius 
R (in pc) and average stellar mass m (in Mq) 



T e = 8.8 x 10" 4 y?NR?/m 



log 10 N - 0.45 



(38) 



Insofar as C 2 is proportional to ((p c i — Pnr)/(p)) 
through the proportionality dependence with £ in (|2£|) and 
assuming (n c l) and R c \ as constant and p c \ 3> (p), then C2 
may be approximated as 



C 2 {t) = C 2 {t = 0)e 



Qt 



or, expressed differently, 
Ci{t)=Ki x e' t/K \ 



(39) 



(40) 
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where K\ and K2 are positive constants. 

This is the theory for simple cases but some cases are 
more complicated. Certain other effects are not negligible, 
such as dynamical friction (see Chandrasekhar 1943a, b,c) or 
Galactic rotation and gravitational tides (Wielen & Fuchs 
1988), both of which produce different values of these con- 
stants. Spitzer (1958) points out that most open clusters 
should be destroyed by interactions with molecular clouds 
on time-scales of few hundred million years, meaning that 
few very old open clusters are known to have survived to 
the current epoch. However, iV-body simulations (Terlevich 
1987) predict that only by encounters with the most massive 
molecular clouds would the cluster be disrupted. 

Apart from theoretical considerations, exponential de- 
crease is indicated by other authors from observational data 
(for example, Lynga 1987a). Janes & Phelps (1994) fit a rela- 
tionship between the age and cluster abundance in the solar 
neighbourhood which follows a sum of two exponentials. If 
it is assumed that there is a constant creation of clusters, 
and that the death of a cluster corresponds to a low value of 
£, this would then imply that C2 is dependent on the sum 
of two decaying exponentials, although with so many effects 
it is difficult to determine which is the correct dependence. 
What is clear, however, is that high values of C2 indicate 
the existence of young clusters. 



5 MEASUREMENT OF CORRELATIONS IN 
THE TMGS AND OTHER SURVEYS 

5.1 Measurement of the TPACF 

The following discussion concerns the measurement of the 
TPACF derived from a rectangular field image of angular 
size a x 6 in a direction containing N stars of known coor- 
dinates and magnitudes (between m m i n and m max ). 

One method of determining correlation functions in a 
distribution of objects, discussed by Rivolo (1986), is to use 
the following estimator for TV points: 



(pp)( r ) 



Vi(r) ' 



(41) 



where Mj(r) is the number of particles lying in a shell of 
thickness 8r from the ith particle, and Vi(r) is the volume 
of the shell. The same applies to (era) but with areas instead 
of volumes: 



N 

(aa)(9) — — 

i= 1 



(42) 



This expression must be corrected for edge effects, i.e. 
some stars are lost in the calculation of Mi(9) when a star 
i is at distance less than 9 from an edge of the rectangular 
image. In the quantity Mi(9), the excess probability 

is measured of finding a star at an angular distance 9 from 
other stars in a ring of thickness A9 with surface area Q(9) 
and this should be proportional to Q(9) for a Poissonian 
distribution. The excess probability is reflected when there 
is an excess of stars inside the ring Q(9). The loss of stars 
due to edge effects is cause by part of the ring falling outside 
the area of rectangle. 



To solve the edge-effect problem it is necessary to calcu- 
late how many stars are lost beyond the edges with respect 
to a non-edge case. Only a fraction Fbe is measured for 
stars separated by an angular distance 9, and each Mi(9) 
must be divided by Fbe(9). The calculation of Fbe is given 
in Appendix A2 for a rectangle. This is represented by 



7r V 0/ a \7r 2 Jab 
When this is applied to (Q), it gives 



(43) 



tot(9) 



1. 



N 2 2tt9c19 [l - § (l + 



(44) 



With this simple algorithm, the angular distances of the 
stars with respect to one another (once their coordinates are 
known) and the angular correlation for a rectangular field 
of stars are obtained. 

The error in the TPACF, as for the TPCF, is derived 
by Betancort-Rijo (1991). In the limit of small A9 (the in- 
terval for calculating the different values of u)(9)} the error 
expression leads to 



l+w(0) 



(Ef=i M *W) 1 



/2 



N*2kOM [1 - f (l + f) f + (f + f - 4) -1^] 



(45) 



After this, the different integrals containing ujt, and 
their errors, can be calculated with standard numerical al- 
gorithms. 



5.2 Some examples with known clusters in 
visible 

The TPACF, as a statistical tool for inferring correlation, is 
applicable to any survey. Some test examples are given below 
in order to determine how good the method is at finding 
regions of the sky with clusters, both where there known 
to be one or two open clusters and where no clusters have 
been identified. Some open clusters were randomly selected 
from Messier catalogue and are listed in Table ^| These are 
six regions with one Messier open cluster, two regions with 
two Messier open clusters and two regions with none. Stars 
down to magnitude 12 in V were selected from the CSC. 

A square field three times larger than the catalogued 
size of the largest cluster was selected and uj t (hereafter 
called simply lj) was derived from equations (jf2|) and (^). 
The value of # ma x is derived as the angle whose u> is zero 
within the error; for larger values of 9, u> is more or less 
equally positive and negative. This criterion is not precise 
when the errors are large but it gives an acceptable estimate. 
C2 is obtained for each region from equation (^). 

Figures |a-c show the TPACF, and Table | lists 6» max 
and C2 • The correlations are positive to within the errors for 
scales shorter than the size of the clusters. Since the corre- 
lations have been calculated with stars down to magnitude 
12, many of the stars do not belong to the cluster. Because 
of this, the correlation is not excessively high, although it is 
high enough to distinguish it from the cases with no clusters 
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Table 2. Regions selected for deriving correlations. 



Objects ai9so.o(h) <5i950.o(deg) Size(') max (') C 2 



M29 


20.368 


38.367 


7 


9 


0.356±0.059 


M34 


2.647 


42.567 


35.2 


19 


0.118±0.012 


M35 


6.097 


24.350 


28 


37 


0.190±0.005 


M39 


21.507 


48.217 


32 


25 


0.049±0.004 


M45 


3.733 


23.967 


110 


143 


0.043±0.001 


M52 


23.367 


61.317 


18 


7 


0.329±0.063 


M36 & M38 


5.478 


34.950 


12 & 21 


21 


0.130±0.014 


M46 & M47 


7.615 


-14.533 


27 & 30 


21 


0.141±0.008 


None 


5.478 


36.950 




13 


0.034±0.016 


None 


7.615 


-12.583 




5 


-0.002±0.050 




O OM45 

B--HM36 & M38 
0---OM46 & M47 



1000.0 2000.0 

en 
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en 



a) 



Fig. @ b) 



Figure 5. ui at some regions with visible magnitudes. 



(Fig. qc) , which represents two random cases in regions with- 
out clusters two degrees to the north of the two fields with 
two clusters each. In the first field without clusters there is 
a small correlation but it is insignificant to within 3.5a. 

Hence, the method does indeed detect clustering where 
there are known to be clusters but not where there is be- 
lieved to be none. One cluster would be enough if its effect 
were not too attenuated by foreground and background stars 
in the chosen range of magnitudes. Moreover, the predictions 
of the sizes of the clusters is quite acceptable in comparison 
with the catalogued sizes (see Table and Figure H) . 



5.3 Peculiarities of TMGS data in calculating 
the TPACF 

Due to the way in TMGS data is obtained, the following 
considerations must be taken into account when applying 
the method and in the examination of the results obtained. 




1000.0 1500.0 2000.0 

en 



Fig. | c) 



• The method of assigning declinations to the stars in 
the TMGS will give extra angular correlations in the 
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ct=16 48 48 , 8=-22 26*40" 

Up to m K =9.0 





Figure 6. max , the predicted size of the average cluster accord- 
ing to the simple model used in this paper versus the catalogued 
size of the cluster in this region (in case of two clusters, the aver- 
age size of both is plotted). 



angles which are multiples of the angular size of the de- 
tector (17 arcsec) or multiples of the quarter diameter. 
Declinations are assigned with discrete values, which 
are separated by distance multiples of a quarter of the 
detector size (~ 4"). However, the correlation using 8 
greater than 4 or 5 arcsec is not significantly affected 
by this characteristic. The range of 9 to be used in this 
paper is from 5 to 250 arcsec; moreover, C2 is averaged 
over a wide range of 8, much higher than 4 arcsec, so 
this effect is negligible for this particular case. However, 
for the correlation of angles smaller than 4 arcsec this 
effect could be important. 
• The strips do not cover the whole sky; neither do they 
completely cover 100% of the area of the rectangles that 
we will use to calculate the angular correlation func- 
tion. The TMGS (Garzon et al. 1993) was carried out 
by means of drift scanning with strips of constant decli- 
nation, and in some cases there are small gaps between 
adjacent strips which make the sky coverage within the 
squares in the region of interest incomplete. To correct 
for this effect, we must multiply (l + u) t (8)) by the frac- 
tion of area covered in the rectangle with regard the 
expression (fli]), assuming that the positions in the rect- 
angle which are not covered is random. The fraction of 
area covered in the squares that we use is high (greater 
than 80 %) so the measure of the error is good enough 
because it is only affected by a factor of between V0.80 
and unity. 

5.4 Application to TMGS 

As an example, two cases will now be applied to the TMGS. 

From the TMGS data a rectangular region of the 
sky was selected centred on a = 16 h 48 m 48 s , 5 = 
-22°26'40" (J2000.0, corresponding to Galactic coordinates 
I = -1°58'15", b = 14°12'13") and sides 0.462° and 1.626°, 
respectively, for the directions of a and S. In this field, the 
calculation of the TPACF was carried out for stars down 
to 9.0 mag in K. From and (0), we obtain uj up to 



Figure 7. ojatl = -1°58'15", b = 14° 12' 13". 

9 = 250", as presented in Figure 0. An upper limit of 250" 
for the value of # max is considered. The parameters used in 
the measure are 9 m i a = 5" and A9 = 5". 

The result after calculating u, using is 

max = 62.5" ± 2.5" 

and 

C" 2 = 1.21 ± 0.22. 

This is an example of a field where some correlation is found 
(C2 is non-zero at the 5.5-<r level). Clearly, there is clustering 
rather than a Poissonian distribution. 

As a second example, a rectangular region was cho- 
sen centered on a = 18 h 50 m 24 s , 6 = -5°14'7" (J2000.0; 
I = 28°10'18", b = -2°8'40") with sides 0.498° and 0.289°, 
respectively, for the direction (a, 8). Calculation of the an- 
gular correlation function was also made for the stars down 
to 9.0 mag in K for this field. 

Using (^) and (g), and with the same parameters as 
for the previous case, the u shown in Fig. ^ is obtained. This 
gives finally: 



22.5 ± 2.5 



and 



C 2 = 3.0 x 10~ 2 ± 14.6 x 10~ 2 . 

In this case the correlation is very weak (C2 is ~ 40 
times smaller than in the previous case), and there is a dif- 
ference of only 0.2 a from zero, which, within the errors, 
implies that there is no correlation or clustering among the 
stars from this field. Figure ^ shows that u is almost zero 
for every value of 9 within the error bar. The value of f? max 
is meaningless in this case, because uj has such a low value 
that the error in the search for the first zero of the function 
is large. When there is little correlation in the field # max 
will be small because the algorithm that eliminates the ze- 
ros due to fluctuations does not work well when uj is much 
smaller than its error. Also, in this case a(C2), the error in 
C2, will be larger than, or of same order as, C2. To overcome 
this problem those data with large cr(C 2 ) will be separated 
and only those with C2 > 3.5 x a^Cz) will be considered as 
having confirmed correlations. 
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a=18 h 50 m 24 s , 8=-5°14'7" 

Up to m K =9.0 

1.0 | . 1 . 




Figure 8. ajatl = 28°10'18", b = -2°8'40". 



5.5 Which clusters can be detected with this 
procedure? 

In the following application, the TPACF will be measured 
for stars down to K = 9.0 mag and for a maximum angle 
of 250". From Appendix A2, this requires a minimum strip 
width of ~ 500". Larger clusters would be detected with 
larger angles; however, 250" is almost the largest angle that 
can be used if the same analytical criteria are to be applied 
to all ten TMGS strips that we will use. 

A typical open cluster has an average size of 5 pc (Janes 
& Phelps 1994), so 500" will be sufficient to detect it at 
~ 2 kpc. Therefore, mostly distant clusters will be detected. 
However, the TMGS is dominated by late K and M giants. 
Therefore, the majority of the stars detected in the magni- 
tude range to be used are significantly further away than 
2 kpc. It is estimated that ~ 10' is the maximum size of 
the clusters which will affect Ci, although this is difficult 
to calculate accurately, and the contribution of the largest 
clusters is not nil but decreases as the cluster increases in 
size. 

The exact calculation of the minimum size is also diffi- 
cult to analyse. The TMGS survey does not detect the in- 
dividual sources in globular clusters since the stars are too 
close to be separated and the whole cluster will appear as an 
extended source. Similarly, if the open cluster is very small 
there is excessive overcrowding, or confusion, of sources, a 
few of them contributing very little to the parameter Ci. The 
separation between the stars in a cluster needs to be more 
than twice the diameter of the detector for its components 
to be detected, i.e. a minimum of 30" to 35". 

In conclusion, it is expected to detect distant clusters 
with angular diameters of between 0.5 and 10 arcmin. Solar 
neighbourhood clusters, such as M45, will provide only a 
small contribution to the quantity Ci, so they will not be 
detected. Hence the analysis will focus on the large-scale 
distribution of clusters in the Galaxy. 



Figure 9. C 2 at a) <5 = -30, -22 and -16; b) 6 = -5, -1 and 
4 ; c) 8 = 22, 33, 39 (in the anticenter direction) and 30 (in the 
anticenter direction). Points with Ci > 3.5cr(Ci) are plotted with 
bar error. 



6 CORRELATIONS AS A FUNCTION OF 
GALACTIC COORDINATES 

When the procedure is used in other regions, the behaviour 
of Ci as a function of I and 6 can be determined. 

Calculations of uj{9) were carried out on several strips 
with constant declination and various sub-strips (Table W. 
The value of I quoted is that at which the strip intersects 
the Galactic plane (6 = 0°). The range of Galactic latitude 
is |&| < 15° for I < 35° and \b\ < 5° for I > 35°. 

Table 3. TMGS regions (with constant declination) used in this 
paper. 



Strip 


S 


I at b = 0° 


Width of strip 




(deg) 


(deg) 


(deg) 


1 


-29.7 


-0.9 


2.51 


2 


-22.4 


7.5 


1.63 


3 


-15.6 


15.4 


0.78 


4 


-5.2 


27.1 


0.29 


5 


-1.6 


31.2 


1.50 


6 


3.5 


36.9 


0.13 


7 


22.0 


57.8 


0.30 


8 


32.5 


70.1 


0.35 


9 


39.1 


167.8 


0.35 


10 


30.4 


178.2 


0.13 



The results are plotted in the Figs. |9| those for which 
Ci > 3.5 cr(Ci) being separated from the others. Not all the 
strips have the same number of stars, so a(Ci) is different 
for different regions. The best strips, with smallest errors, 
are those which cross the plane at I — — 1°, I = 8° and 
I = 31°, i.e. strips 1, 2 and 5 respectively. The main features 
observed are: 
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a 4.0 



O 1-27, > 3.5a 
□ 1=27, < 3.5o 
01-31, > 3.5a 
A 1=31, < 3.5a 
<l=37, >3.5a 
Vl=37, <3.5a 
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21 



0.0 □□□□□□□OT7PV1 
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So 
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b() 



Fig. | b) 



1-58, > 3.5a 
□ 1=58, < 3.5a 
01=70, > 3.5a 
A 1=70, <3.5a 
< 1=1 68, >3.5a 
V 1-1 68, < 3.5a 
>l=178, >3.5a 
3.5a 




3S: ale *= 



b(") 



Fig. I , 




(i) There is general a C'2 dependence on Galactic latitude 
in the disc for I < 90° (i.e. strips 1-8 ) with some excep- 
tions pointed out in (iv). This relationship is especially 
remarkable in strips 1, 2 and 5. The function C2{b) is 
approximately parabolic. Figure ^ shows the following 
fits to the best data (those with C2 > 3.5 affi^)) of 
these strips: 



C 2 = 2.2 x HrV for / = 0° 
C2 = 6.4 x 1(T V for I = 8° 
and 

C 2 = 20 x 10 



V + 0.0366+ 1.00 



for I = 31° 



(46) 
(47) 

(48) 



(ii) Outside the bulge, there is a general increase in C2 with 
I, as seen in equations (pj|), ( |47| ) and (Q. When the 
data with C2 > 3.5 cr{C%) are averaged between b — —3° 
and b — 3°, and also for 3° < |6| < 5°, there is a 
dependence on I as shown in Table Id 

(iii) In the inner bulge region, with / < 15° and b < 5°, 
the correlations are almost zero (Fig. ^ a)). When the 
relative correlation differences at I ~ 30° and the inner 
bulge are compared with those predicted by the simple 
model (Figs. ^, ||) a correlation deficit can be seen for 
the bulge. 

(iv) Three regions where there is an excess of correlation 
with respect to both the / and b dependence occur at 
at I = 31°, I = 37° (Fig. |b) and I = 70° (Fig. §3), i.e. 
strips 5, 6 and 8. 

(v) The anticentre region gives a correlation similar to that 
of the intermediate Galactic longitude (50° < I < 100°) 
region in the plane, or even lower (see Table ^). The 
value of C2 does not increase with I, or it does so very 
smoothly outside the intermediate / region. The simple 
model prediction is not very accurate but, comparing 
with the results in Figs. ^ and it can be seen that 
C 2 is larger at I ~ 150° than at I ~ 40°. This signifi- 
cant departure cannot be explained without including 
an extra component in the model. 



Table 4. The C2 dependence on I. 



Figure 10. Fit of C2 of strips 1, 2 and 5 (those that are greater 
than 3.5 <r(C2)) to respective parabollic functions: 2.2 X 10 — 3 t> 2 , 



6.4 X 10" 



and 20 X lO^fr + 0.036b + 1.0 (solid lines). 



Strip 


I at b = 0° 
(deg) 


C 2 in |6| < 3° 


C 2 in 3° < |6| < 5° 


1 


-1 






2 


8 


3.5 x 10~ 2 


6.9 x 10~ 2 


3 


15 






4 


27 






5 


31 


0.38 


1.29 


6 


37 


0.90 


3.01 


7 


58 


0.74 


0.97 


8 


70 


1.26 


1.08 


9 


168 


0.78 


0.71 


10 


178 


0.67 
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6.1 Causes of the dependence on Galactic 
coordinates 

(i) The b-dependence: This is predicted by the simple model 
created above. As seen in Fig. [|, lu should increase with 
\b\ because (n c i) ((p c i — Pnc)/{p}) 2 in expression ( ps| ) in- 
creases with |6|. 

(ii) The l-dependence: Again, the /-dependence can be ex- 
plained by the model. 

(iii) The bulge: When Ci values for bulge regions are com- 
pared with those for other regions where there is only a 
disc component, the observational data give a lower rel- 
ative correlation than is prediced by the model (Fig. |^), 
i.e. there is less correlation in the bulge than expected. 
This is consistent with the bulge being older than the 
disc (see Section ^l]). The bulge is known to have a 
different population of stars from the disc (Frogel 1988; 
Lopez Corredoira et al. 1997) , and these are expected to 
be an older population (Rich 1993), so the lack of cor- 
relation would be expected. Moreover, as pointed out 
earlier, the central region will have patchy extinction 
which will tend to increase the correlation. Hence, the 
C*2 values generated by clusters should be even lower 
than the observed value indicating even fewer young 
clusters in this region. 

(iv) Excess correlation in some zones in the plane: Feinstein 
(1995) points out that very young clusters are tracers 
of the spiral arms. As has been noted young clusters 
provide a significant contribution to C2. However, spi- 
ral arms are not included in the simple model, which 
contains only the disc and bulge; hence where an arm 
is crossed it is to be expected that the correlation will 
be higher than predicted. Whereas the distribution of 
old clusters varies smoothly with Galactic position, the 
young clusters have a far more irregular distribution. 
Of the four areas which cross the plane between I = 31° 
and I = 70° three show a significant correlation excess. 
The I = 31° region is almost tangential to the Scutum 
arm and also runs through the Sagittarius arm. The 
I = 37° region will also cut the Sagittarius arm. The 
excess correlation at I = 70° can be attribute to the star 
formation region in the Perseus arm. The far-infrared 
source G69C in this region has been attributed to star 
formation regions by Kutner (1987). Strip 7 (I = 58°) 
is an exception which could possibly be due to this line 
of sight missing a significant star formation region as it 
crosses the arm. 

Using the simple model des cribed previously with one 
shell of clusters (Section 3.1) but only allowing clusters 
in the arms, the degree of clustering in the arm can 
be estimated. With the use of (^) a relationship be- 
tween various cluster parameters can be obtained (for 
example, at I « 70°, b w 0°, C 2 is 1.88 ± 0.34). It is 
assumed that the greater part of the contribution to C2 
is due to the arm clusters, and that, as estimated by 
Cohen et al. (1997, in preparation) ~ 30% of the stars 
observed in that region are from the arm (with Ar w 1 
kpc in the line of sight), whereas the rest belongs to 
the disc. As 9 mal is 46.5" in this region, the equivalent 
cluster diameter, from (^), is 2.0 pc (when the dis- 
tance is r = 8.8 kpc; Georgelin & Georgelin 1976). This 
is smaller than the average value of 5 pc given by Janes 



& Phelps (1994), although this could possibly be due 
to the fact that only the core of the cluster is seen by 
the TMGS (the typical size of the core of a cluster is 1 
or 2 pc; Leonard 1988), in which there are bright and 
more massive stars; alternatively, a possible difference 
from the solar neighbourhood could also be the explana- 
tion. Nevertheless, this is only a rough estimate and no 
further conclusions should be drawn from this number. 
The order of magnitude should, however, be correct. 
With these data it possible to estimate the density of 
stars within the clusters. Using (^) gives 

-2 

(n c i) ~ 3 x 10 



(p) 



(49) 



or, making use of (Kfl) with lirR^nci) <C 1 and p nc <g 



Pel, 



(n c i) w 3 x 10 4 



|7ri?ci(n o i)/0ci + p n 



Pel 



(50) 



When (n c i) is determined with last equation, the fol- 
lowing condition is then derived for solving the second- 
degree equation with real numbers: 



Pnc < 2 x 10 >cl 

and 

5 x 10" 7 pc" 3 < (nci) < 2 x 10" 



6 -3 
pc , 



(51) 



(52) 



which is equivalent to saying that most of the stars 
within the arm in this line of sight are forming clus- 
ters. Indeed, with such a low p nc (because of (|5l|)), by 
(Eh and (§2) gives 



10 s (p) < p c i < 5xl0 5 <p), 



(53) 



which, with (p) ~ 1.4 X 10 3 pc 3 (From the Galaxy 
model used in Wainscoat et al. 1992), gives 



140 pc" 3 < p c i < 700 pc" 



(54) 



i.e. between 500 and 3000 stars per cluster, a quite rea- 
sonable number (Friel 1995 talks about a typical mass 
of young clusters of few thousand solar masses) . 
The same case is repeated at I — 37° and at I = 31°. 
When the line of sight cuts an arm, the correlation is 
greatly increased. If the arm contribution to the number 
of stars is low, the correlation will be diluted, although 
its contribution may be not totally negligible. It also 
depends on the density of the other components for that 
direction; the disc, for example, dilutes the correlation. 
Hammersley et al. (1994) and Garzon et al. (1997) sug- 
gest that there should be an excess of bright stars in 
the region at I = 27°, 6 = 0° which might be due to 
the interaction of a bar with the disc, giving rise to a 
star formation region. Concerning the deficit of corre- 
lation measured for this star formation region, it could 
be concluded that, if the star formation were sufficiently 
large-much greater, say, than the size of the rectangle 
used from sampling-then this would explain the non- 
detection of clustering in this region, 
(v) Deviation from a simple model of I dependence in the 
anticentre: Towards the anticentre (210° > I > 150°) 
the correlation is significantly less than that predicted 
by the simple model. This implies that the number of 
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clusters is small and, in particular, that very young clus- 
ters are rare in this direction. This is in agreement with 
the results from visible observations of clusters (Payne- 
Gaposchkin 1979). Janes & Phelps (1994) argue that 
there is a lack of old clusters in the inner disc, since 
they wo uld be destroyed by molecular clouds (see Sec- 
tion 4.1), but that there will a relatively large number of 



young clusters. However, the ISM density falls off with 
distance from the Galactic centre, so in the outer disc 
there will be significantly less star formation and hence 
far fewer young clusters. The existence of a gradient 
in open cluster age has been commented on by Lynga 
(1980) and Van den Bergh & McClure (1980), and an 
explanation was attempted by Wielen (1985). As has 
been noted previously, young clusters contribute signif- 
icantly to C?2, so the lack of young clusters in the anti- 
centre region would lead to a reduction in the amount 
of correlation. 

Within a few degrees of the plane the arms have a sig- 
nificant influence on the amount of correlation for the 
longitude range 30° < I < 90°. One possible reason for 
the apparent deficit in Ci towards the anticentre could 
be the excess due to the arms in the comparison re- 
gions. In order to discount this possibility, a comparison 
of the in-plane anticentre region can be made with an 
off-plane region at I — 31°. The model predicts that the 
ratio C , 2(Z=31 ,5 <|fe|<15 )/C , 2 (Z=168 ,|6|<5 ) should 
be less than unity, however, the measurements give a 
value of 4 or 5. This gives further support to the hypoth- 
esis that there are fewer young clusters than expected 
towards the anticentre. 

A further possible reason for the lack of correlation in 
the anticentre is that there could be a significant de- 
crease in the total numbers of clusters in the anticen- 
tre rather then an increase of age with Galactocentric 
distance. However, observations in the solar neighbour- 
hood (Lynga 1980; Van den Bergh & McClure 1980; 
Janes & Phelps 1994) support the hypothesis of increas- 
ing age of the clusters with Galactocentric distance. 
Also, clusters in the anticentre have a greater angular 
size because they are nearer but this is taken into ac- 
count in the model and should not cause the deficit of 
correlation. 

When the analysis presented here is applied to other 
large-area infrared observations it may contribute to our 
understandanding of this dependence on cluster age on 
Galactocentric distance. The TMGS data clearly shows 
the presence of young clusters in the inner Galaxy and 
consequently a decrease in C2 in the anticentre direc- 
tion. However, An accurate quantification is not possi- 
ble because of arm contamination in most parts of the 
regions, for which complete information is unavailable. 



1997) or 2MASS (Skrutskie et al. 1997) surveys will be ideal 
for this technique as the increased numbers of stars will re- 
duce the errors. It is even possible, with a large number 
of stars in the survey, to apply the technique for different 
ranges of apparent magnitude. Studying the clustering of 
stars at different apparent magnitudes is equivalent to do 
studying in three dimensions (I, b and the average distance 
r which is associated with the treated range of magnitudes) . 

A simple model has been developed. This model could 
be improved by introducing a density dependence as a func- 
tion of the distance from the centre of the cluster, perhaps 
a power-law dependence. 

In this paper the method has been applied to the 
TMGS. Is has been shown that a simple model in which 
old open clusters trace the whole Galaxy with a density of 
clusters proportional to the density of stars agrees quite well 
with the data. An exception to the general agreement are 
specific regions in the plane where the higher-than-expected 
clustering can be attributed to star formation in the spiral 
arms. A second departure from the simple model is the re- 
duced C2 in the outer disc and in the bulge due to a lack of 
young clusters. 

In one of the regions with an excess, at I = 70° in the 
plane, the approximate limits for the cluster density and 
the density of stars inside the cluster are derived. These are, 
respectively, 5 x 10" 7 pc -3 < (n i) < 2 x 10~ 6 pc -3 and 
140 pc~ 3 < pd < 700 pc -3 . There is, however, a lower-than- 
expected correlation at I — 27°, b = 0°. There is believed 
to be a huge star formation region in this direction and the 
lack of correlation could be due to the star formation region 
being far larger than the sample area. 

As has been pointed out by Friel (1995), the oldest open 
clusters may be viewed from two perspectives with regard 
the formation of the Galaxy: a halo collapse or a continu- 
ous accretion and infall of material from the halo on to the 
Galactic disc. Either perspective is possible. The first should 
justify which were the original star formation regions that 
were the origin of the present old clusters in the outer disc 
and how they travelled there from their place of origin. The 
second perspective needs to test the infall of matter from 
the halo as well as the existence of star systems in the halo. 
Further improvements on these cluster searches and better 
numbers will give us a hint concerning these questions on 
the origin of the Galaxy. A better determination of C2 in 
the bulge region will tell us about the age of bulge clusters 
if these exist. In this article we have observed a relative ab- 
sence of correlation in the bulge that is somewhat less than 
the prediction of our simple model, but at best the predic- 
tion could say, as in the case of the anticentre, whether the 
correlation is greater or less than the improved model and 
enable us to reach further conclusions. 



7 CONCLUSIONS 

A technique is developed for searching for clustering in stel- 
lar surveys using correlation functions. The mathematical 
tools are useful for any field of stars and can be applied to 
any survey, especially those at carried out at infrared wave- 
lengths, which permit a study of the distribution of stars 
throughout almost the entire Galaxy. The DENIS (Epchtein 
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APPENDIX Al: CALCULATIONS FOR TWO 
INTERSECTING SPHERES 

The centres of two spheres are separated by a distance x, 
and S BB (x;y, R c i) is the area of the shell of the first sphere 
(the one with radius y) contained in the second sphere 
(Betancort-Rijo 1995), with radius R c i (see Fig. [jj). There 
are various cases: 

(i) Rd>y. 

((a)) x < (R c \ — y): the first sphere is contained wholly 
within the second sphere, so that the area of the 
shell that is inside the second sphere is the area of 
the whole shell, and 



S BB (x; y, R c i) = Any 



(Al) 



((b)) (R c i — y)<x< (R c \ + y): this is the area of the shell 
for 9 up to 9o, according to Fig. [j]; simple trigonom- 
etry gives the value of #o (as a function of x, y and 

-Rci) as 



E>2 _ 2 
_rt cl — X 



COS Uo = 



2xy cos ( 



y 2 - Rl 



-2xy 



(A2) 
(A3) 



and the area of the shell up to 0q is 



S BB (x;y, R c i) = l-xy 1 



27n/ 2 (l-cos 8 ), 



d6 sinO 



(A4) 



so that, changing 9q by the value given in (A3), we 
get: 



S BB (x;y,Rci) = ^(R 2 c i^(x-y) 2 ). 



(A5) 



(ii) Rd < y. 



((a)) x < (y — Rd): here there is no contact between the 
shell and the second sphere, so 



S BB (x; y, Rd) = 0. 



(A6) 



((b)) (y — R c \) < x < (y + R c i): again, the calculation 
proceeds as for (p 



S BB (x;y, Rd) 



ny 



(Rd — (x — y) ). 



(A7) 
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Quantities that we are interested for calculating are 
J Q cI dx x 2 S BB (x;y,R cl ) and J R ^ l+y dx xx 2 S BB (x; y, R cl ). 
Again, we distinguish several cases: 



(i) y < R c v- following ( JA1| ) and (|AE|), 



dx x S BB (x;y,R cl ) 



dx x 2 4iry 2 



o 



+ 



4 



dx x 2 ^-(R 2 1 - (as - y) 2 ) 



4 R 3 2 / , 3 y 1 / y \ 



and 



da; x Ss S (a;; y, R c 



Rd+y 



dx x 2 ^-(R 2 1 - (x - y) 2 ) 



(±JL-l.(±\ l 

3 ciy U-Rci 16 \RcJ 



(ii) Rd < y < 2RcV following ( |Aq ) and ( |A7| ) 

-■Rci 

da- x 2 S BB (x;y,R c i) 



dx x 2 — (R 2 l - (x - y) 2 ) 
x 



4 
3 



„3 „,2 / -, 3 y 1 / y \ 



and 

--Rci+a 



da; x S BB (x;y,R c i) 



Rd+y 



dx x 2 ^-(R 2 1 - (x - y) 2 ) 



4 R 3 2 / 3 y 1 f y 



(iii) y > 2R C \: again, with (A6) and (A.7) 
-Kcl 

da; x 2 Sss(x;y,R c i) = 



/o 

and 

f«ol+l/ 



da; a; 5*88(3;; y, -R c 



Rr 



R c i+y 



(A8) 



(A9) 



(A10) 



(All) 



(A12) 



dx x 2 ^(R 2 cl -(x- y) 2 ) = -nRly 2 . (A13) 
x 6 



APPENDIX A2: EDGE EFFECTS IN THE 
MEASUREMENT OF THE TPACF IN 
RECTANGULAR FIELDS 

We have a rectangular surface with size a x b (x from 
to a and y from to b). A ring of negligible thickness and 



of radius 9 whose centre is located at (x, y) contains part 
of the surface inside the rectangle, f(x,y), and the rest of 
it is outside the rectangle. Due to the loss of a part of the 
ring surface outside the rectangle we measure only a fraction 
Fbb of the star counts separated by an angular distance 9. 
Assuming that the distribution of the stars in the rectangle 
is homogeneous, we have 

da; / dyf(x,y). (A14) 
o Jo 

The value of f(x, y) depends on the case. With the con- 
dition 29 < a, 29 < b (which is to be satisfied in the case 
considered here), the only posibilities are: 

(i) when 9 < x < (a — 9) and 9 < y < (b — 9): the whole 
ring is inside the rectangle, so 



Fee = \ 
a o 



f{x,y) 



(A15) 



(ii) when 9 < x < (a — 9) and y < 9: the portion of 
ring contained inside the rectangle is from the angle 
— s'm~ 1 (y/9) to it + sin~ 1 (y/9) (the value of sin _1 (---) 
between and tt/2). Then, 



f{x,y) 



n + 2sin- 1 {y/9) 



2tt 



(iii) when x < 9 and 9 < y < (b 
previous case: 

7r + 2sin~ 1 (a;/60 



f(x,y) = 



2tt 



(A16) 
similarly to the 

(A17) 



(iv) when x < 9, y < 9 and (x 2 + y 2 ) < 9 2 : the portion of 
ring inside the rectangle is from the angle — sin - (y/9) 
to tt/2 + sin~ 1 (a;/6l), St..) 

7r/2 + sin -1 (a;/6l) + sin _1 (y/0) 



f(x,y) = 



2tt 



(A18) 



(v) when x < 9, y < 9 and (x 2 + y 2 ) > 9 2 : this case is 
similar to the previous one, but we must add another 
portion of ring which is between —(ty/2 + sin _1 (a;/S)) 
and 7r + sin~ 1 (y/6'), so 



f{x,y) = 



2sin- 1 {x/9) +2sin~ 1 (y/0) 



2tt 



(A19) 



Other possible situations are avoided by the symme- 
try properties of the integral used in the evaluation of Fbe, 
which is reduced to the following calculation: 

Fhe(0) = ^[(a- 2(9)(&-20)] 



+ 



+ 



+ 



2 

a b 
2 

a b 
4 

a b 
4 

a b 



dx / dy 
Jo 

dy I dx 
Jo 



7r + 2 sin 1 (y/r) 
2~H 

7r + 2 sin~ (x/r) 



2tt 



da; / dy 
'o 



dx 



n/2 - 



l (x/r) + sin 1 {y/r) 



2tt 



= 1 - 



+ 



1 



dy 



-7r/2 + sin~ 1 (a;/r) + t 



l (y/r 



' + 



/9 TV 

U+2 



- 4 



2tt 

a b 



(A20) 



